Performance Bounds for Column-block Partitioning of Parallel Gaussian Elimination and Gauss-jordan Methods

نویسنده

  • Tao Yang
چکیده

Column-block partitioning is commonly used in the parallelization of Gaussian-Elimination(GE) and Gauss-Jordan(GJ) algorithms. It is therefore of interest to know performance bounds of such partitioning on scalable distributed-memory parallel architectures. In this paper, we use a graph-theoretic approach in deriving asymptotic performance lower bounds of column-block partitioning for both GE and GJ. The The content of the information herein does not necessarily reeect the position of the Government and oocial endorsement should not be inferred. 1 new contribution is the incorporation of communication cost in the analysis which results in the derivation of sharper lower bounds. We use our scheduling system PYRROS to experimentally compare the actual run time performance with that derived by these lower bounds on the nCUBE-2 hypercube parallel machine.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gauss-Jordan Elimination for Matrices Represented as Functions

This theory provides a compact formulation of Gauss-Jordan elimination for matrices represented as functions. Its distinctive feature is succinctness. It is not meant for large computations. 1 Gauss-Jordan elimination algorithm theory Gauss-Jordan-Elim-Fun imports Main begin Matrices are functions: type-synonym ′a matrix = nat ⇒ nat ⇒ ′a In order to restrict to finite matrices, a matrix is usua...

متن کامل

Stability of Gauss-Huard Elimination for Solving Linear Systems

This paper considers elimination methods to solve dense linear systems, in particular a variant due to Huard of Gaussian elimination [13]. This variant reduces the system to an equivalent diagonal system just as GaussJordan elimination, but does not require more floating-point operations than Gaussian elimination. Huard's method may be advantageous for use in computers with hierarchical memory,...

متن کامل

Singly-Bordered Block-Diagonal Form for Minimal Problem Solvers

The Gröbner basis method for solving systems of polynomial equations became very popular in the computer vision community as it helps to find fast and numerically stable solutions to difficult problems. In this paper, we present a method that potentially significantly speeds up Gröbner basis solvers. We show that the elimination template matrices used in these solvers are usually quite sparse a...

متن کامل

Measuring the Overhead of Intel C++ CnC over TBB for Gauss-Jordan Elimination

The most efficient parallel execution of dense liner algebra algorithms is to build and evaluate the task graph constrained only by the data dependencies between the tasks. Both Intel C++ Concurrent Collections (CnC) and Threading Building Blocks (TBB) libraries allow such task-based parallel programming. In this paper, we first analyze all the three types of data dependencies in the tiled in-p...

متن کامل

Minimizing communication overhead for matrix inversion algorithms on hypercubes

The mirin contribution of this report is the development of novel algorithms {that make efficient use of the communication system in distributed memory architectures with plrocessing elements interconnected by a hypercube network. These algorithms achieve almost optirr~al overlap of communicatior~ delays by computation, leading to a minimization of communicatioi~ overhead. Rigorous ana1yt:ical ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007